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Abstract 

We consider a time division duplex (TDD) nt x rir multiple-input multiple-output (MIMO) system with 
channel state information (CSI) at both the transmitter and receiver. We propose X- and Y-Codes to achieve high 
multiplexing and diversity gains at low complexity. The proposed precoding schemes are based upon the singular 
value decomposition (SVD) of the channel matrix which transforms the MIMO channel into parallel subchannels. 
Then X- and Y-Codes are used to improve the diversity gain by pairing the subchannels, prior to SVD precoding. 
In particular, the subchannels with good diversity are paired with those having low diversity gains. Hence, a pair 
of channels is jointly encoded using a 2 x 2 real matrix, which is fixed a priori and does not change with each 
channel realization. For X-Codes these matrices are 2-dimensional rotation matrices parameterized by a single angle, 
while for Y-Codes, these matrices are 2-dimensional upper left triangular matrices. The complexity of the maximum 
likelihood decoding (MLD) for both X- and Y-Codes is low. Specifically, the decoding complexity of Y-Codes is the 
same as that of a scalar channel. Moreover, we propose X-, Y-Precoders with the same structure as X-, Y-Codes, but 
the encoding matrices adapt to each channel realization. The optimal encoding matrices for X-, Y-Codes/Precoders 
are derived analytically. Finally, it is observed that X-Codes/Precoders perform better for well-conditioned channels, 
while Y-Codes/Precoders perform better for ill-conditioned channels, when compared to other precoding schemes in 
the literature. 

Index Terms 

MIMO, TDD, precoding, singular value decomposition, condition number, diversity, error probability 

I. Introduction 

We consider time division duplexed (TDD) nt x rir multiple-input multiple-output (MIMO) systems, where 
channel state information (CSI) is fully available both at the transmitter and receiver Channels in such systems 
are typically slowly time varying and therefore subject to block fading. It is known that precoding techniques can 
provide large performance improvements in such scenarios. A popular precoding approach is based on singular 
value decomposition (SVD) HI, ^ of the channel so that the MIMO channel can be seen as parallel channels. 
Waterfilling schemes Q can be used to further improve the total capacity. 
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Another approach is based on channel inversion (CI) lIU, named as zero-forcing (ZF) precoding, which however 
suffers from a loss of power efficiency. Non-linear precoding such as Tomlinson-Harashima (TH) precoding IS), ||6l 
was exploited in Q. Linear precoders, which involve simple linear pre- and post-processing, have been proposed in 
im, m and references therein. Despite having low encoding and decoding complexity, the linear precoding schemes 
and the TH precoder have a low diversity order if all the modes of transmission are utilized (i.e., full-rate). Diversity 
order can be improved, only by transmitting over a subset of all possible modes of transmission, but this results 
in a rate loss. Precoders based on lattice reduction techniques IfTOl and vector perturbation IfTTII can achieve full 
rate and high diversity, but at the cost of high complexity. We therefore see a tradeoff between rate, diversity and 
computational complexity. This motivates us to design precoding schemes which achieve both full rate and high 
diversity at low complexities. 

In this paper, we consider SVD precoding for MIMO system, which transforms the MIMO channels into parallel 
subchannels. At the receiver, maximum likehhood decoding (MLD) can be employed separately for each subchannel. 
To improve the low diversity order of the SVD precoded system, we propose some simple linear codes prior to 
SVD precoding. These codes are named X- and Y-Codes due to the structure of the encoder matrix, which enables 
to flexibly pair subchannels with different diversity orders. Specifically, the subchannels with low diversity orders 
can be paired together with those having high diversity orders, so that the overall diversity order is improved. The 
main contributions in this paper are: 

1) X-Codes: A set of 2-dimensional (2-D) real orthogonal matrices is used to jointly code over pairs of 
subchannels, without increasing the transmit power Since the matrices are effectively parameterized with 
a single angle, the design of X-Codes primarily involves choosing the optimal angle for each pair of sub- 
channels. The angles are chosen a priori and do not change with each channel realization. This is why we 
use the term "Code" instead of "Precoder". Further optimization of angles are based upon minimizing the 
average error performance. At the receiver, we show that the MLD can be easily accomplished using iir low 
complexity 2-D real sphere decoders (SDs) ifTSl . It is shown that X-Codes have better error performance 
than that of other precoders, yet it becomes worse when the pair of subchannels is poorly conditioned (for 
definition, see Theorem 2). This motivates us to propose the Y-Codes. 

2) Y-Codes: Instead of using rotation for pairing subchannels, we use a linear code generator matrix which is 
upper left triangular Y-Codes are parameterized with 2 parameters corresponding to power allocated to the 
two subchannels. These parameters are computed so as to minimize the average error probability. The MLD 
complexity is the same as that of the scalar channels in linear precoders (S), ||9l and is less than that of the 
X-Codes, while the performance of Y-Codes is better than that of X-Codes for ill-conditioned channel pairs. 

3) X-, Y-Precoders: The X- and Y-Precoders employ the same pairing structure as that in X-, Y-Codes. However, 
the code generator matrix for each pair of subchannels is chosen for each channel realization. We observed 
that the error performance of X- and Y-Precoders is better than that of X- and Y-Codes. 

A precoding scheme, named E-dmin, has been recently proposed in |12|. The structure of its precoding matrix 
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is similar to that of X-Codes. However, the E-dmin precoder is only optimized for 4-QAM symbols, but suffers 
from loss in power efficiency with higher order modulation. The performance and decoding complexity of E-dmin 
are worse when compared to the proposed X-, Y-Codes. 

The rest of the paper is organized as follows. Section introduces the system model and SVD precoding. In 
Section we present the pairing of subchannels as a general coding strategy to achieve higher diversity order in 
fading channels. In Section |IV] we propose the X-Codes and the X-Precoders. We show that ML decoding can be 
achieved with rir 2-D real SDs. We also analyze the error performance and present the design of optimal X-Codes 
and X-Precoders. In Section|V] we propose the Y-Codes and Y-Precoder. We show that they have very low decoding 
complexity. We analyze the error performance and derive expressions for the optimal Y-Codes and Y-Precoders. 
Section |VT] shows the simulation results and comparisons with other precoders. Section lVlIl discusses the complexity 
of the X-, Y-Codes/Precoders in comparison with other precoders. Conclusions are drawn in Section IVIIII 

Notations: Superscripts ^, ^, and * denote transposition, Hermitian transposition, and complex conjugation, 
respectively. The n x n identity matrix is denoted by I„, and the zero matrix is denoted by 0. The E[-] is the 
expectation operator, || • || denotes the Euclidean norm, and | • | denotes the absolute value of a complex number The 
fields of complex numbers, real numbers and non-negative real numbers are C, M and M+, respectively. Furthermore, 
[cj denotes the largest integer less than c. Finally, we let and denote the real and imaginary parts of a 
complex argument. 

II. System model and SVD precoding 

We consider a TDD nt x rir MIMO (n^ < rit), where the channel state information (CSI) is known perfectly at 
both the transmitter and receiver. Let x ~ {xi, . . . ,a;„J-^ be the vector of symbols transmitted by the rit transmit 
antennas, and let H = {hij}, i = 1, . . . , rir, J = 1, • • ■ , ?^t, be the Ur x rit channel coefficient matrix, with hjj as 
the complex channel gain between the j-th transmit antenna and the i-th receive antenna. The standard Rayleigh 
flat fading model is assumed with hij ~ Afc{0, 1), i-c, i.i.d. complex Gaussian random variables with zero mean 
and unit variance. The received vector with symbols is given by 

y = Hx + n (1) 

where n is a spatially uncorrelated Gaussian noise vector such that E[nnt] — iVoI„^. Such a system has a maximum 
multiplexing gain of n^. 

Let the number of transmitted information symbols be Ug (n^ < rir). The information bits are first mapped 
to the information symbol vector u = (ui, . . . ,u„J-^ G C"% which is then mapped to the data symbol vector 
z = (zi, . . . , Zn^)'^ e C"= using a Us X Us matrix G. 

z = Gu + u" (2) 

where u*' G C"' is a displacement vector used to reduce the average transmitted power. 

Let T be the nt x Ug precoding matrix which is applied to the data symbol vector to yield the transmitted vector 

X = Tz (3) 
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In general T,G and Uq are derived from the perfect knowledge of H at the transmitter and they are crucial to 
the system performance and complexity. The transmission power constraint is given by 

E[||x||2] = Pj, (4) 

and we define the signal-to-noise ratio (SNR) as 

^ No 

The proposed X-, Y-Codes are based on the SVD preceding technique, which is based on the singular value 
decomposition of the channel matrix H = UAV, where U £ C"-''"-, A e C"-^"-, V e C"-^"', and UUt = 
VVt = I„^, and A = diag(Ai, . . . , A„ J, with Ai > A2 • • • > A„^ > 0. 

Let V e C"sX"t be the submatrix with the first Ug rows of V. The SVD precoder uses 

T = V\ 

u" = (5) 

and the receiver gets 

y = HTu + n (6) 

Let U G C"r^x"s be the submatrix with the first Us columns of U. The receiver computes 

r = UV = Au + w (7) 

where w G C"° is still an uncorrected Gaussian noise vector with E[ww'l'] = NqIu,, A = diag(Ai, A2, • • • A„J, 
and r = (ri, . . . , r„J^. The SVD preceding therefore transforms the channel into Ug parallel channels 

n^XiUi+Wi i = l,...,ns (8) 

with non-negative fading coefficients A^. The overall error performance is dominated by the minimum singular 
value A,i,. When lis = n,. ~ rit, the resulting diversity order is only 1. 

III. Pairing good and bad subchannels 

Without loss of generality, we consider only the full-rate SVD preceding scheme with even n,. and Us = n,.. The 
matrix G G C""^^"' is now used to pair (jointly encode) different subchannels in order to improve the diversity 
order of the system. The preceding matrix T G C"*^"'' and the transmitted vector x are given by 

T = V^ x = Vt(Gu + u°) (9) 

Let the list of pairings be {ik,jk) € [1, n-r], k G [1, nr/2] and ik < jk- On the fc-th pair, consisting of subchannels 
ik and jk, the information symbols and Uj^ are jointly coded using a 2 x 2 matrix Ak- In order to reduce the 
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ML decoding complexity, we restrict the entries of to be real valued. Each = {ak,i,j}, i,j £ [1,2], is a 
submatrix of the code matrix G as shown below. 



(10) 



where gij is the entry of G in the i-th row and j-th column. 

We shall see later, that an optimal pairing in terms of achieving the best diversity order is one in which the fc-th 
subchannel is paired with the (n^ — fc + l)-th subchannel. For example, with rir = 6, the X-Code structure is given 
by 

"^1,1,2 

02,1,1 ^^2,1,2 
as, 1,1 03,1,2 

03,2,1 03,2,2 

02,2,1 02,2,2 
Ol.2,l Oi 2,2 



G = 



(11) 



and the Y-Code structure is given by 







G 



01,1,1 



01,1,2 



02,1,1 02,1,2 
03,1,1 03^12 
03,2,1 

02,2,1 



01.2,1 



(12) 



Let 



A r iT 



Due to the transmit power constraint in (|4|, and uniform power allocation between the nr/2 pairs, the encoder 
matrices A^ must satisfy 

2Pt 



E[||AfeU, + u°|| 



rir 



(13) 



The expectation in ( fT3T l is over the distribution of the information symbol vector u^. is the subvector of the 
displacement vector u*^ for the fc-th pair 

The matrices A^ for X- and Y- codes can be either fixed a priori or can change with every channel realization. 
The latter case leads to the X-, Y-Precoders. 



The names X- and Y-Codes are due to the structure of the code generating matrices in jilt and I12t . 
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A. ML Decoding 

Given the received vector y, the receiver computes 

r = UV - Au° (14) 

Using ([U and (|9]l, we can rewrite (fT4l) as 

r = AGu + w = Mu + w (15) 

where M = AG is the equivalent channel gain matrix and w = U^^n is a noise vector with the same statistics as 
n. Further, we let 

Let Mfe e K^^^ denote the 2x2 submatrix of M consisting of entries in the ik and jk rows and columns. Then 
( fTsT i can be equivalently written as 

rir 

rfe = MfcUfe + Wfe, k = l,...,— (16) 

3fi(ufc) e iSfc, where 5^ is a finite signal set in the 2-dimensional real space . 

Assuming that the same set is used for the imaginary component, the spectral efficiency 77 is given by 



7? = 2glog2(|5fc|) (17) 



k=l 



From ( fT6] l, it is clear that MLD reduces to separate MLDs of the k pairs, which can be further separated into 
independent ML decoding of the real and imaginary components of u^. Then the MLD for the k-th pair is given 
by 



3fJ(ufe)=arg min P(rfc) - Mfc3?(ufe)|r (18) 

sR(ufc)e5fc 



and 



3(ufe)=arg min ||3(rfc) - Mfc3(ufe)f (19) 
where Ufe is the output of the ML detector for the k-th pair. 

B. Performance Analysis 

Let Pk denote the word error probability (WEP) for the fc-th pair of sub-channels with the ML receiver. The 
overall WEP for the transmitted information symbol vector is given by 

p = 1 - n (1 - Pk). (20) 

k=l 

From ( fTsT i and ( fT9] l. we see that WEPs for the real and the imaginary components of the k-th pair are the same. 
Therefore, without loss of generality we can compute the WEP only for the real component, denoted by Pf., and 
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then Pfc = 1 — (1 — P/.)^- Let us further denote by Pj.(5R(ufe)) the probability of the real part of the ML decoder 
decoding not in favor of 3fJ(ufe) when is transmitted on the fc-th pair P^. can then be expressed in terms of 
Pfc(3?(ufc)) as follows 

= ^ E P'km^k)) (21) 

where Pj.(5ft(ufe)) has to be evaluated differently for X-, Y-Codes and X-, Y-Precoders. To explain this difference 
we need the following definitions. 

For a given channel realization, i.e., deterministic value of A^^. and A^,. for the fc-th pair, we let 
P^(3?(ufc), Ai^ , Aj^. , A/;) be the error probability of MLD for the real component of the fc-th channel, given that 
the information symbol Ufc was transmitted on the fc-th pair. For X-, Y-Codes, the matrices are fixed a priori 
and are not function of the deterministic value of channel gains, and therefore, P^{^{uk)), is given by 



Pfe(KK)) =E,,^^^ [Pj3?(u,),A,,,A,,,Afe) 



(22) 



We observe that Pj,(5R(ufe)) is actually a function of A^ and therefore the optimal error performance is obtained 
by minimizing d^TT i over A^. Then the optimal matrix for the fc-th pair is given by 



A°P* = argmin '^(m -a, ) ^'ki^i'^k), Xi, , Xj,, A^) 

A I- ' '■k -Ik . 

SR(ufc) 



(23) 



The minimization in ( |23] l is constrained over matrices A^ which satisfy ( fT3] ). The optimal error performance Pfe°''* 
is given by 



p^°'"=w-\ E %...A,,., k(5RK),A,„A,„Ar) 



(24) 



For the X-, Y-Precoder, the matrices A^ are chosen every time the channel changes. For optimal performance. 



the matrices A^ are chosen so as to minimize the error probability for a given channel realization Let A?''*, the 



optimal encoding matrix for the fcth pair is then given by 



ATiK^K) = argmin ^ P^C^RK), A., , A,, , A,) (25) 



K(u, 



The optimal error performance for X-, Y-Precoders is therefore given by 



P^'"" = W-\ E k(5RK),A.,,A,,,Ar(A.,,A,J) 



(26) 



Comparing (|26] | and ( |24] |. we immediately observe that the optimal error performance of X-, Y-Precoders is better 
than that of X-, Y-Codes. Our next goal is to derive an analytic expression for Pj.(3fi(ufc)). We shall only discuss 
the derivation for X-, Y-Codes, since the performance of X-, Y-Precoders is better than X-, Y-Codes and therefore 
have at least as much diversity order as X-, Y-Codes. Getting an exact analytic expression is difficult, and therefore 
we try to get tight upper bounds using the union bound. 
Theorem 1: The upper bound to Pj, is given by 



Pk<Ck{\Sk\-l){ ^' +0(7-"") (27) 
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where 

4 = {nt - ik + l){nr - ik + 

A C{ik)ii2dk~l) ■■■ 5- 3-1) 

= ^k 

where gk{Ak) is the generalized minimum distance, as defined in (|63]) (See Appendix [All. C(m) (1 < to < 

min(nr,nt)) is defined in ifTTl . 

Proof - See Appendix lAl ■ 

Let us define the overall diversity order 

A -logP 



5ord ^ lim 



Then, it is obvious that 



7^oo log ^ 



<5ord > minJfc. (28) 



This bound also holds for the X-, Y-Precoders, since the error performance of the X-, Y-Precoders is always better 
than that of X-, Y-Codes. 

C. Design of Optimal Pairing 

From the lower bound on So^d (given by ( |28] l) it is clear that the following pairing of sub-channels 

ik = k, jk = Ur - k + 1. (29) 

achieves the following best lower bound 

Sord>{^ + l){nt-'-^ + l) (30) 



Remark 1: Note that this corresponds to a cross-form generator matrix G, and is not the only pairing for the 
best lower bound. Also we note that the diversity order improves significantly, when compared to the case of no 
pairing. It can be shown that, if only Us (us is even) out of the subchannels are used for transmission, the lower 
bound on the achievable diversity order is [rir — ^ + ^ + 1). ■ 

Although it is hard to compute A^.^*, we can compute the best A^, denoted by A^, which minimizes the upper 
bound on Pk in ( |27] i. Then we have 

Afc = arg max — - — (31) 

A,|E[||A,u,+u«p]=ia: 

Using (|27l), (l29| and (|3B, we obtain 

P'k<Ck{\Sk\^l){^^y'Koi,~^l) (32) 
where 5^ = {rit — k + l){nr — fc + 1). 
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IV. X-CODES AND X-PRECODERS 

A. X-Codes and X-Precoders: Encoding 

For X-Codes, each symbol in u takes values from a regular M^-QAM constellation, which consists of the M- 
PAM constellation S = {T{2i — {M — 1)) |i = 0, 1, • • • {M — 1)} used in quadrature on the real and the imaginary 
components of the channel. The constant 

A I 3^^ 



and 



2{AP - 1) 



rir 



is the average symbol energy for each information symbol in the vector u. Gray mapping is used to map the bits 
separately to the real and imaginary component of the symbols in u. We fix u° to be the zero vector. In order to 
avoid transmitter power enhancement, we impose an orthogonality constraint on each A^, and parameterize it with 
a single angle Ok- 

cos{dk) sm{9k) 
-sin(6'fc) cos{6k) 

We notice that 1) both A^ and G are orthogonal; 2) for X-Codes we fix the angles 9k a priori whereas for the 
X-Precoders we change the angles for each channel realization. 



1, 



. n,./2 



(33) 



(34) 



B. X-Codes and X-Precoders: ML Decoding 

From (fTsT l and ( fT9l ) it is obvious that two 2-D real SDs are needed for each pair. Since there are ^ pairs, the 
total decoding complexity is rir 2-D real SDs. For X-Codes, the matrices in ( fTSl l and ( fT9] ) are given by 

-Xj^smidk) Xj^^cos{9k) 

C. Optimal design of X-Codes 

In order to find the best angle 9k for the fc-th pair, we attempt to maximize gk{Ak) under the transmit power 
constraints. For X-Codes, let Zk = 5R(u/c) — 5R(vfc) be the difference vector between any two information vectors, 
which can be written as 



6E, 



(A/2 - 1) 



M 



where 



= {{P, q)\0 < \p\ < {M - 1), < \q\ < (M - 1), {p, q) ^ (0, 0)} 
As defined in Appendix |A] the generalized pairwise distance is 

6PTfpcos(0fe)+gcos(0fc);^ 



d^(5RK),5R(vfc),Afc) 



nr{A'P - 1) 



(35) 
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Since is parameterizable with a single angle 6*^, let 



Also let 

Using dSST l and the (l63l l. we have 



gkiek,M) = gk{Ak) 



<^p,9 = tan I - 



n^(M2 - 1) 

Using dSTI ). the best ^/j, denoted by 6*^, is given by 



91 — arff max min (p + q ) cos (0^ — 09„ (36) 



Following ( |32] ). the best achievable upper bound for Pj, is 



(37) 



Remark 2: It is easily shown by the symmetry of the set §m that it suffices to consider Ok G [0, j] for the 
maximization in ( |36] |. The min-max optimization problem does not have explicit analytical solutions except for 
small values of M, for example M = 2. But since the encoder matrices are fixed a priori, these computations can 
be performed off-line. ■ 
For small MIMO systems (for example 2x2 MIMO) it is possible to get a tighter upper bound by evaluating the 
expectation in (1611 1 (See Appendix A). Pi is then upper bounded as follows. 

P' < V (70/81)(M2-l)V4 
^ - ^ AP(pcos(ei) + qsm(diW{p^ + q^) ' ^ ^ 

(p,q)eSM 

where 9i is the angle used for the only pair. For larger MIMO systems it is preferable to use the inequality in 
( |62] | (See Appendix A), since evaluating the expectation containing two singular values is tedious. In Fig. [l] we 
compare the word error probability of a 2 x 2 MIMO system with that given by ( [38] l. and observe that the union 
bound is indeed tight at high SNR. 

In Fig. |2] we plot the variation of the upper bound to the WEP w.rt. the angle 9i for the 2x2 MIMO system 
with 4-QAM and 16-QAM modulation. We observe that WEP is indeed sensitive to the rotation angle. With 4- 
QAM modulation, the WEP worsens as the angle approaches either or 45 degrees. With 16-QAM modulation, 
the performance is even more sensitive to the rotation angle. Moreover we observe that the performance is poor, 
when the angles are chosen near 18.5, 26.6 and 33.7 degrees, corresponding to (ps.i, </32,i, and (p3.2, respectively. 
From ( |6T] l. it is clear that the performance at high SNR is determined by the minimum value of the distance 
||Mfe(3fi(ufc)-3fi(vfe))||2, which is 
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when {p, q) takes values over the set §>nj. If 9^ = tan^^{—p/q) for some {p, q) E §j\/, then the minimum distance 
is independent of A^^ and depends only upon A^,,. This implies a loss of diversity order since the diversity order of 
the square fading coefficient A^^ is less than that of A|^ . For the case of rir = rit ~ 2, this would mean a reduction 
of diversity order from 4 to 1. The set and the critical angles are illustrated in Fig. [3] 



D. Optimal design of X-Precoder 

For X-Precoders, the optimal rotation angle is tedious to compute due to lack of exact expressions for error 
probability. Just like X-Codes we resort to bounds on error performance. It is possible to get union bound expression 
for the error probability of the k-th pair. However, we do not further upper bound the union bound by using ( l62b . 
since by doing so we would have lost information about Xj^. Instead in the pairwise sum, we look for the term 
with the highest contribution to the union bound and try to minimize this term. The best angle for the fc-th pair is 
then given by. 



SkiKkAjk) = arg max min dl{p,q,9k) 

efc6[0,27r] {p.,q)eSM 

= arg max min d\{p,q,9k) 
efce[o,f](p,g)GSM 



where 



dl{p.q,ek) 



(39) 



(40) 



Just like for X-Codes, it can be shown that for the maximization in ( [39] ), it suffices to consider the range [0, |-] 
for 9k- The optimization problem in ( [39] l is difficult, but can be solved exactly for small values of M. Also, the 
minimization over (p, q) G §a/ need not be over the full set containing \E>m\ ~ 4:M{M — 1) elements. In fact it can 
be shown that the number of elements to searched is at most (il/^ — 3il/ + 6)/2. Therefore, for M = 4 (16-QAM), 
we need not search over the full set of 48 elements, but rather it suffices to search over only 5 elements. 

Theorem 2: For il/ = 2 (4-QAM), the exact ^fe(Ai^, A^J is given by, 

7r/4 f3k<V3 



tan 



where 

Proof - See Appendix iBl 
Further let 



Pk — T — • 



(41) 



d'k,mini^ik ' ^jk) 



max min df.(p,q,9k) 
efce[o,f](p,9)eSM 



Using ( |42] |. the union bound to P^, is given by 

P'k < (M^ - 1)E 




dl 

/c,min 



i^ik 1 ^jk 



2Nn 



(42) 



(43) 



The expectation in ( l43T l is over the joint distribution of (A^^ , Ajj. ) and is difficult to compute analytically. We 
therefore use Monte-Carlo simulations to evaluate the exact error probabiUty. 
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V. Y-CODES AND Y-PRECODER 

A. Motivation 

It is observed that the error performance at high SNR is dependent on the minimum value of the 
distance d^(3?(ufc), 3fi(vfc), Afc) over all possible information vectors ^ v^. Using the definition for 
cJ^(3?(ufc), 3?(v/j), Afe) (see Appendix lAt we have 



4(5R(ufc),5R(vfc),Afc) = ||Mfc(5RK)-5R(vfe))| 



AleL + AleL (44) 



where et = Afc(5R(ufe) - 5R(vfc)). 

Let Pk be the condition number of the equivalent channel for the fc-th pair (see Theorem |2]i. We have > 1 
since Ai^ > A^,.. For the special case of /3k = 1, (i^(3?(ufc), 3fi(vfc), Afc) is proportional to ||efe|p, which is the 
Euclidean distance between the code vectors. Therefore, the design of good codes is independent of the channel 
gain. In such a scenario, it is known that for large M choosing the code vectors as points of the 2-dimensional 
hexagonal lattice would yield codes with good error performance. However, the design of good codes becomes 
difficult for values of > 1. We immediately notice, that the effective Euclidean distance in (l44l i gives more 
weight to el i, which is the difference of the vectors along the first component (since A^^, > Xj^). Since the total 
transmit power is constrained, codes should be designed such that the minimum separation of any two code vectors 
is more along the first component. 

For X-Codes and X-Precoder, the minimum separation was increased by rotating the QAM constellation by 
an optimal angle. However, with this approach, apart from gaining separation along the first component, we also 
achieve separation along the second component. It is noted that the same diversity order can be achieved even if the 
minimum separation along the second component is small. Since the average transmit power is constrained, optimal 
code design would try to choose code vectors such that for the same transmit power, more separation is achieved 
along the first component (without caring much about the separation achieved along the second component). This 
observation along with the motivation of further reducing the decoding complexity leads to the design of Y-Codes 
and Y-Precoders. 

B. Y-Codes and Y-Precoders: Encoding 
The matrices A^ have the structure 



Afc = 



2bk 

where ak,bk G K^. Let Sk be the set of pairs of integers defined by the Cartesian product 



(45) 



Sk = i \0, 11 X 



M 



For example, with M = 4, the set Sk is given by 

5fc = {[0,0]^, [0,lf, [l,Of, [l,lf} (46) 
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We consider the 2-D codebook of cardinaUty M generated by applying to the elements of Sk- The code 
vectors Yk{v), v = 1, . . . , M, are given by 



Yk{v) = 



The real and imaginary components of the displacement vector for the fc-th pair, u° are given by 



'■k 

T 



{M - l)ak , 

- , -Ofc 



(47) 



(48) 



2 

Due to the transmit power constraint in dlj] ). ak and bk must satisfy 

bl + al—— = — (49) 
12 rir 

Information bits are Gray mapped to codebook indices in such a way. Hence the Hamming distance between bit 
vectors corresponding to close by code vectors is as small as possible. The only difference between Y-Codes and 
Y-Precoders is that, for Y-Codes, the parameters Uk and bk are fixed a priori, whereas, for the Y-Precoders, these 
are chosen every time the channel changes. 

C. Y-Codes and Y-Precoders: ML Decoding 

Using our codebook notation, the ML decoding rule in ( fTSl ) and ( fT9b . can be equivalently written as 

v['^ = arg mill \\^irk) - AkYk{v)f (50) 

v£{0,--- ,(M-1}} 

vi"^^ = arg min ||3(r,)-AfenW||' 

■u6{0,--- ,(-"-'—1)} 

where and ^j^.*^-* are ML estimates of the codeword indices transmitted on the real and imaginary components 
for the fc-th pair. 

We next discuss a low complexity algorithm for the optimization problem in dSOt . The algorithm is the same for 
all pairs, and the same for both the real and imaginary components of each pair. Therefore, we only discuss the 
algorithm for the real component. We first partition the 2-D received signal space (M^) into + l) regions as 
follows. 



Ro 



Ri 



Ri 



[x,yfe R^l - oo < 
[x,yf£ 'M?\{M - 1) < 
[x,yf&9?\{2i-l)< 



M-l 



< 1 



X 



A/ - 1 



Kf, ak 2 
X M - 1 



< +00 



< (2i + l) 



(51) 



where i e [l,Af/2- 1]. 

In Fig. m we illustrate the 5 regions with M = 8 for the real component of the fc-th pair We next discuss a low 
complexity ML decoding algorithm for Y-Codes. The first step of the decoding algorithm is to find the region to 
which the received vector belongs. Let 

tk 



3?(r,J , M + l 
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The received vector belongs to the region , where (^k is explicitly given by 



tfc < 



< 



M . ^ M 



(52) 



tk otherwise 



For example, in Fig. |4] the received vectors pi, p2, and p3 belong to R^, Ri, and R^ respectively. It can be 
shown that, once the received vector is decoded to the region i?^^. , the ML code vector is one among a reduced set 
of at most 3 code vectors. Therefore, at most 3 Euclidean distances need to be computed to solve the ML detection 
problem in ( fSOl l, as compared to computing all the M Euclidean distances in case of a brute force search. For 
example, in Fig. [H for the received vector pa e R^, the ML code vector is among 1^(6), yfc(7) or 1^(8). 

However, once we know the region of the received vector, it is possible to directly find the ML code vector 
even without computing the 3 Euclidean distances. This involves just checking a few linear relations between the 
2 components of the received vector Therefore, the ML decoding complexity of Y-Codes is the same as that of a 
scalar channel. For example, in Fig. |4] the received vector pa is to the right of the perpendicular bisector between 
1^(6) and Yk{8). The vector p3 is also above the perpendicular bisector between 1^(7) and Yk{8). From these two 
checks, it can be easily concluded that the ML code vector is Yk{8). Due to the structure of the codebook, the ML 
decision regions can be very easily outlined. In Fig. |4] the dotted lines demarcate the boundary of the ML decision 
regions. The hatched region illustrates the ML decision region of 1^(5). 

D. Optimal design of Y-Codes 

Given the optimal pairing in ( |29] l, the next step towards designing optimal Y-Codes is to find the optimal value 
of (afcjbfc), which minimizes the average error probability. For Y-Codes, once chosen, {ak,bk) are fixed and do 
not change with every channel realization. Since the ML decision regions are known precisely, it is possible to 
calculate the exact error probability. With our new codebook notation, we identify code vectors by their index in 
the codebook, and given by 




(53) 



V 



where P^iv) is the probability of error when the code vector Yk{v) is transmitted. is given by 
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The functions gi{ak,bk), g2{ak,h) and gsiak^bk) are given by 

gi{ak,bk) = 1- / -==Q{^k{x))dx 
Jo Vt^Nq 

g2{ak,bk) = I- -^^=Q{-^k{x))dx 
J-oo VttA'o 

To compute the optimal {ak,bk), we have to minimize Pj, w.r.t. {ak,bk) subject to the transmit power constraint 
in ( |49] l. However, it is difficuh to get closed form expressions for the optimal {ak,bk) due to the intractability 
of the integrals in ( |54| ). This difficulty is further compounded due to the evaluation of expectation over the joint 
distribution of (A^j. , Xj^). However, since {uk, bk) are fixed a priori, it is always possible to approximately compute 
the optimal {ak,bk) off-line, using Monte-Carlo techniques. 

E. Optimal design of Y-Precoder 

For the Y-Precoder, finding the optimal {ak,bk) for each channel realization is again difficult due to the 
intractability of the integrals in ( |54] i. In the case of Y-Codes these could be computed offline since {ak,bk) are 
fixed a priori. However, for Y-Precoders these cannot be computed offline, since the optimal {ak,bk) have to be 
computed every time the channel changes. Therefore, we try to optimize {ak,bk) by minimizing the union bound 
for Pj,. The union bound is given by 



Pk < {M - 1)E 



Q 



(55) 



2iVo 

where the expectation is over the joint distribution of (A^^. , A_,,_ ) and 

4,ninK, ^ min (a?,4(z; - wf + A,\6^((-l)" - (-1)-)^) (56) 

where v and w are distinct indices of the codebook. 

The optimal choice of (afe, bk), denoted by (a^., b^), which maximizes d\ ,„in(afc, bk) for the fixed channel gain 
of (Aifc,AjJ, is given by 

Theorem 3: The optimal value of (0^,6^) defined as 

(4,6fc)=arg max , c'fc.minlafc- ^fe) (57) 



is given by 



{ (afc ,6*, ) 6 (R+ ) ^ I f.| HipL = ijt } 



M-'-l 



{al,bl)^{ (58) 

, \/ 3rv 031+ A f)' \f^^(^+W) ) Pl<^^Y^ 



where M' = 



I _ M-'-l 
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The corresponding optimal value of df. 



{ak,bk) is given by 




3 



4,min(aL&fc) = ' 



(59) 




3 



n.(3/3^ + 



) 



3 



Proof - See Appendix ICl 



If we now look back at the codebook for Y-Precoders, we notice that there is power allocation on the 2 channels 
through the parameters ak and b^, which can be chosen optimally based upon the knowledge of channel gains. 
From (ISST l. we observe that the Y-Precoders use only the first channel (the better channel), when channel condition 



channel condition. This adaptive nature of the Y-Precoders enables it to achieve better error performance in badly 
conditioned channels. Y-Codes also have a fixed-rate allocation between the two channels of a pair, since out of the 
log2(Af ) bits, one bit can be used to decide whether the vector in the codebook is at even index (corresponding to 
the second component equal to +bk) or at odd index (corresponding to the second component equal to —bk)- The 
remaining bits are then used to appropriately choose among the vectors at even or odd indices. Therefore, in a way, 
the proposed Y-Codes always transmits 1 bit of information on the bad channel and log2(Af) — 1 bits on the good 
channel. This rate allocation may not be the best and therefore even better code books can be constructed. One more 
aspect that is important is the decoding complexity which for the proposed scheme is low and is independent of 
M. It would be challenging to obtain good code books with variable rate allocation and low decoding complexity. 
We however do not address this problem in this paper 



In this section, we compare the performance of X-, Y-Codes and X-, Y-Precoders with other precoders. For all the 
simulations we assume rir = rit. The optimal matrices A^, are chosen as discussed previously. Comparisons are made 
with 1) the E-dmin (equal dmin precoder proposed in fT2l ). 2) the Arithmetic mean BER precoder (ARITH-MBER) 
proposed in ID, 3) the Equal Energy linear precoder (EE) based upon optimizing the minimum eigenvalue for a 
given transmit power constraint Q), 4) the TH precoder based upon the idea of Tomlinson-Harashima precoding 
applied in the MIMO context Q) and 5) the channel inversion (CI) known as Zero Forcing precoder ||4l. 

A. Effect of channel condition on error performance 

In Fig. |5] we plot the error performance of all precoding schemes for a 2 x 2 MIMO system at 7 = 26 dB, as 
a function of the condition number /? = A1/A2. We fix the total channel gain to be 1, i.e., Ai^ + A2'^ = 1, and 
the target spectral efficiency to be ?/ = 8 bps/Hz. We briefly discuss the precoding schemes which are compared to 
the proposed X-, Y-Codes. ARITH-MBER transmits Ug symbols, each from a QAM modulation alphabet. When 
Us — 1, 256-QAM modulation (16-PAM on the real and imaginary component) is used on the first component of 
the code vector and the second component is not used for transmission. When Ug = 2, 16-QAM modulation is used 



is bad ( Pi 



For good channel condition, power is distributed between the two channels depending on the 



VI. Simulation results 
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on both the components. E-dmin is a precoding scheme in which the complex linear precoding matrix is adapted to 
each channel realization, but both the channels are always used (i.e. Ug ~ 2). The modulation alphabet is 16-QAM. 

In Fig.|5] we notice that for schemes, which are fixed and do not adapt with the varying channel, have good error 
performance for small values of (3. The BER performance is however poor with increasing (3. Error performance 
of X-Codes is also seen to deteriorate with increasing (3. The only exception are the Y-Codes and ARITH-MBER. 
For ARITH-MBER with = 1 the opposite is true since it always uses only 1 channel for transmission. The 
performance of Y-Codes is more stable with increasing /3 due to the fact that the codebook is designed in such a 
way to maximize the minimum separation along the first component without caring much about the separation on 
the second component which corresponds to the weak channel. 

It is also observed that both the X-, Y-Precoders appear to adapt well to the changes in the channel. However, 
the Y-Precoders perform better than X-Precoders for /3 > 3, and hence for channels which are badly conditioned 
Y-Precoders would have a better error performance compared to X-Precoder. We shall see later that, indeed for the 
Rayleigh fading channel, Y-Precoders perform better than X-Precoder. 

Therefore, we can conclude that codes which are fixed and do not change with each channel realization would 
have a poor error performance for large values of /3 since they would waste power along the second component 
without any effect on the effective Euclidean distance. In fading channels, /3 can be very large at times. Therefore, 
, a good code is one which adapts to /?. 

We also observe that Y-Codes and Y-Precoders have the best error performance when channel condition is bad. 
This justifies the fact that codes for badly conditioned channels should be designed to have more separation in the 
minimum distance along the component corresponding to the stronger channel. 

B. Diversity order comparison 

We next discuss the diversity order achieved by the various precoding schemes with Rayleigh fading. Let the 
number of subchannels used for transmission be (n^ < n^). The diversity order achieved by the linear precoders 
(EE and ARITH-MBER) and THP is [rir — ris + l){nt — ris + 1) and [nt — Us + 1) respectively, whereas the 
diversity order achieved by E-dmin and X-, Y-Codes is [rir — ^ + 1) (77,4— ^ + 1). The CI scheme achieves infinite 
diversity, but it suffers from power enhancement at the transmitter Among all the other schemes (except CI), we 
observe that E-dmin and X-, Y-Codes have the best diversity order The subsequent simulation results assume a 
Rayleigh fading channel. 

C. Comparison of BER performance with full-rate transmission 

In Fig.|6] we plot the bit error rate (BER) of all precoders for rir ~ rit = ns = 2,4 and a target spectral efficiency 
of 2ns bps/Hz. The proposed X, Y-Precoders and E-dmin have the best error performance. The increased diversity 
order achieved by the pairing scheme is obvious from the higher slope of the error rate for the X, Y-Precoders 
compared to a slope of order 1 for the linear precoder ARITH-MBER and THP. The performance of CI is inferior 
due to enhanced transmit power requirement arising from the bad conditioning of the channel. It is observed that 
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the proposed Y-Precoders perform the best for n,. = rit = 2, with E-dmin only 0.5 dB away at BER of lO^'^. For 
7ir = rit = 4, E-dmin performs better than Y-Precoders by 0.4 dB at BER of 10"'^. However, E-dmin has this 
performance gain at a higher encoding and decoding complexity compared to the Y-Precoder. 

D. Comparison of BER performance for 71^ = fit = 2,4 

In Fig. |7] we plot the BER for rir = nt ^ 2, and a target spectral efficiency of 4, 8 bps/Hz. It is observed 
that the best performance is achieved by the proposed Y-Precoder For a target spectral efficiency of 4 bps/Hz, 
ARITH-MBER also has a similar performance. However, for a spectral efficiency of 8 bps/Hz, the performance of 
ARITH-MBER is worse than that of Y-Precoders by about 2.8 dB at a BER of 10^"^. This is because, to achieve 
higher diversity order, linear precoders do not use all the modes of transmission (i.e. Ug < mm{nr,nt)). Hence, 
to achieve the same target spectral efficiency, they have to use higher order QAM, which results in loss of power 
efficiency. 

In Fig. [8] we plot the BER for = rit = 4, and a target spectral efficiency of 8, 16 bps/Hz. For a target 
spectral efficiency of 8 bps/Hz, E-dmin and ARITH-MBER have the best error performance. Y-Precoders perform 
only about 0.5 dB away at a BER of 10^'^. However, for a target spectral efficiency of 16 bps/Hz Y-Precoders 
perform the best. ARITH-MBER (rig = 2 with 256-QAM modulation on both channels) performs 2.6 dB worse 
than Y-Precoders at a BER of lO^''. E-dmin performs the worst and is about 3.5 dB away from Y-Precoders at 
BER of 10""^. E-dmin has poor performance since the precoder proposed in lfT2l has been optimized for 4-QAM, 
and therefore, it does not perform that well when the target spectral efficiency is higher than 2nt bps/Hz. 

E. X-Codes vs. Y-Codes 

In Fig. |9l we compare the BER performance of the proposed X and Y-Codes for a rir = nt ^ 2 system with 
spectral efficiency of 4, 8 bps/Hz. It is observed that Y-Codes have a significant performance gain over X-Codes. 
For a target spectral efficiency of both 4 and 8 bps/Hz, Y-Codes perform better than X-Codes by about 1.5 dB at 
a BER of 10^"^. This is primarily due to the novel constellation structure of the proposed Y-Codes (as compared 
to the simple rotation encoder for X-Codes),which ensures that the minimal distance between constellation points 
does not become too small when channel is poorly conditioned. In Fig. [TOl we compare the BER performance of 
the proposed X and Y-Codes for a = nt = 4 system with spectral efficiency of 8,16 bps/Hz. Y-Codes again 
perform better than X-Codes by about 0.7 dB for a spectral efficiency of 8 bps/Hz, and by about 1.5 dB for a 
spectral efficiency of 16 bps/Hz. 

F. X-, Y-Codes vs. X-, Y-Precoders 

In this section, we discuss the performance gain achieved by optimally choosing the encoder matrices for 
each channel realization, as compared to having them fixed a priori. 

In Fig. |9l we compare the performance of the X-, Y-Precoders with that of X-, Y-Codes for = rit = 2 with a 
target spectral efficiency of 4, 8 bps/Hz. For a target spectral efficiency of 4 bps/Hz, X-, Y-Precoders perform only 
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marginally better than X-, Y-Codes (by only about 0.2 dB at BER of 10^"^). However, for a target spectral efficiency 
of 8 bps/Hz, the X-Precoder performs better than X-Codes by about 1.0 dB, whereas Y-Precoders perform better than 
Y-Codes by about 0.2 dB at a BER of 10^^. Therefore, changing the encoder matrices with channel realization 
is beneficial for X-Codes. However, it is observed that Y-Precoders do not have as much gain in performance 
compared to Y-Codes. 

For Ur ^ Uf = 4, it is observed from Fig. [T0]that for a target spectral efficiency of 8 bps/Hz, X-, Y-Precoders have 
almost similar performance as X-, Y-Codes. However, for a target spectral efficiency of 16 bps/Hz, X-Precoders 
perform better than X-Codes by about 0.7 dB, whereas Y-Precoders perform better than Y-Codes by about 0.3 dB 
at a BER of lO^^. 

The performance gain of X-Precoders over X-Codes is much more significant as compared to that of the Y- 
Precoders over Y-Codes. Also, for X-Precoders, this performance gain is significant only with higher order QAM. 
This is due to the fact that the error performance is much more sensitive to the rotation angle for higher order 
QAM (see Fig. |2]i, and therefore adjusting the rotation angle with respect to the varying channel is expected to 
result in performance improvement. 

On the other hand Y-Precoders are only marginally better than Y-Codes irrespective of the spectral efficiency. 
This is attributed to the fact that for the Y-Precoders we optimize the upper bound to the probability of error rather 
than the exact error probability. We do this, because of the analytical intractabiUty of the exact error probabiUty 
expression. This leads to a suboptimal choice of the encoder matrices, and therefore a suboptimal error performance. 

This is obvious from Fig.[TTl where we plot the exact optimal word error probability in comparison with the error 
probability of the proposed suboptimal Y-Precoder The exact optimal word error probability (i.e, error probabiUty 
with the optimal choice of encoder matrices) is computed through Monte Carlo techniques using ( |53]) and the 
integrals in ( |54] |. The exact optimal error probability is better than the proposed suboptimal Y-Precoder by about 
1.8 dB for a n,. ~ nt = 2 system, and is better by about 1.0 dB for a rir ^ rit = 4 system at a word error probability 
of 10^^. This therefore suggests the existence of better Y-Precoders compared to what has been proposed in this 
paper 

VII. Complexity 

In this section, we discuss the computational complexity of X-, Y-Codes and compare it with other precoding 
schemes. The linear precoders (ARJTH-MBER and EE), E-dmin and X-Codes need to compute the SVD 
decomposition of H. The CI and THP schemes involve computing the pseudo-inverse and QR decomposition 
of H respectively. The complexity of computing SVD, QR as well as pseudo-inverse is 0{nf.). These operations 
are computation intensive. However, TDD is generally employed in a slowly fading channel, and therefore these 
computations can be performed once, and can be used until the channel changes. We, therefore, do not consider 
the complexity of these decompositions in the discussion below. 
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A. Encoding Complexity 

The encoding complexity of all the schemes is 0{nrnt), which is due to the transmit preprocessing filter. If 
the number of operations were to be computed, CI and X-, Y-Codes would have the lowest complexity. This is 
so because, linear precoders need to compute an extra pre-processing matrix (in addition to SVD). THP also has 
to do successive interference pre-cancelation (in addition to QR). On the other hand, E-dmin and X-, Y-Codes 
need to only compute SVD, which automatically gives the pre-processing and the post-processing matrices. Also, 
X-, Y-Codes have lower encoding complexity compared to E-dmin, because the encoding matrices A^. are real, as 
opposed to being complex for E-dmin. CI has an even lower complexity since there is no spatial coding. 

B. Decoding Complexity 

The decoding complexity of all the schemes have a square dependence on rir- This is due to the post-processing 
matrix filter at the receiver. The linear precoders, CI and THP employ post processing at the receiver, which 
enables independent ML decoding for each subchannel. With QAM modulation symbols this is only a rounding 
operation for each subchannel. E-dmin and X-Codes on the other hand use sphere decoding to jointly decode 
pairs of subchannels. ML decoding for X-Codes is accomplished by using 2-dimensional real sphere decoders. 
However, E-dmin requires ^f■ 4-dimensional real sphere decoders. The average complexity of sphere decoding is 
cubic in the number of dimensions (and is invariant w.r.t modulation alphabet size M) ITJI , and therefore X-Codes 
have a much lower decoding complexity when compared to E-dmin. The ML decoding complexity of Y-Codes is 
independent of M, and is equal to the ML decoding complexity of a scalar channel. Therefore, the linear precoders, 
CI, THP and Y-Codes have the lowest ML decoding complexity among the considered preceding schemes. 

VIII. Conclusion 

We proposed X-, Y-Codes/Precoders which can achieve full-rate and high diversity at low complexity by pairing 
the subchannels prior to SVD preceding. It is observed that indeed pairing of channels can significantly improve 
the overall diversity. Among all possible pairings, pairing the fc-th channel with the (n^ — fc + l)-th subchannel 
was found to be optimal in terms of achieving the best diversity order. One way of pairing the subchannels is by 
using rotation based encoding as for X-Codes/Precoders. The proposed X-Codes/Precoders have good performance 
for well conditioned channels. For ill-conditioned channels, we then proposed Y-Codes/Precoders. It is shown by 
simulation and analysis that Y-Codes/Precoders achieve the best error performance at very low complexity, when 
compared to other precoders in the literature. In practice, in order to improve the overall performance, it is possible 
to adaptively switch between X- and Y-Codes/Precoders depending on the channel conditions. 

Appendix A 
Proof of Theorem 1 

Towards proving Theorem 1, we shall find the following Lemma useful (See proposition 1 in ifTTl for the same 
lemma and its proof). 
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Lemma 1: Given a real scalar channel modeled by y = ^Jax + n, where x = ±^/E^, n ^ Af{Q,a'^). Let 
F{a) = Ca'^ + o(a^), for a ^ 0+ be the cdf (cumulative density function) of a, where C is a constant and k is 
a positive integeo Let 7 = Eg/a"^ be the SNR. Then the probability of error is given by ^(7) = '&a[Q{\/orj)]- 
The lemma states that the asymptotic error probability for 7 cx) is given by 



P = 



C((2fc- 1) • (2fc-3) 



5 • 3 • 1) 



-7-^ + 



0(7" 



The proof of Theorem 1 is as follows. Let {3fi(ufe) 3?(vfc)} denote the pairwise error event that, given u^. was 
transmitted on the fc-th pair, the real part of the ML detector for the fc-th pair decodes in favor of some other vector 
5R(vfc). Further, let us denote the probability of this event by Pj,(5R(ufc) — > 5R(vfe)) (pairwise error probability 
(PEP)). Using the union bounding technique, Pj,(5R(ufc)) is then upper bounded by the sum of all the possible 
pairwise error probabilities. From (ISTT i it is clear that this upper bound on P^(3fJ(ufc)) induces an upper bound on 
Pj. which is given by 

^ ^ E E ^ 3?K.)) (60) 

Due to Gaussian noise, this can be further written as 



n < 



15, 



oE E 



5R(ufc)K(vfc)7^!R(uk) 



where 



^d^(jR(ufc),jR(vfc),Afc 
2iVn 



Mfc(5RK)-5RK.))||2 



(61) 



4(5R(ufe),5RK.),Afc) 
Q{x) 



The expectation in (I6TI 1 is over the joint distribution of the channel gain (A^^ , Xj^)- The joint pdf (probability density 
function) of the ordered eigenvalues of H^H is given by the well known Wishart distribution lfT4l . However, in 
(|6TT i evaluating the expectation over (A^^. , ) is still a difficult problem except for trivial cases (like Ur ~ Uf ^ 2). 
We therefore try to bound (i^(5J(ufc), 5R(vfc), A^) such that the bound depends only on A^^. Since A^^. > Xj^ > 0, 
using the definition of M and M^, we have 



dlmuk),m^k),Ak) > A? 4(sR(u^,)^sR(v,),Afe) 



(62) 



where 



ek 



Afc(3?K)-3?(vfc)) 



^Any function f{x) in a single variable x is said to be o{g(x)) i.e. f{x) = o(g{x)) if .^'s ^ as x — ► 
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and we let Ck.i denote the first component of the 2-dimensional vector et- We further define the generalized 
minimum distance as follows : 

gkiAk)= mill dlmuk),^iwk),Ak) (63) 

SR(ufc)#3?(vfc) 

Since Q( ) is a monotonically decreasing function with increasing argument, we can further upper bound (l6Tl i using 
(l63l l as follows : 



P, < i\Sk\ - 1)E 




A-,gfc(Afc) 



(64) 



2No 

For a Rayleigh faded channel, the marginal pdf of the s-th eigenvalue (for 0) is given by |[T6l 

f{Xl) = C(,s)(A2)^'(^)^'-(^)-i + o((A2)^'(^)^'-(^)-i) (65) 

where Nt{s) = [nt — s + 1), Nr{s) = {rir — s + 1) and C(s) is a constant given in |[T6l . Using the pdf in ( l65T l. 
the cdf = P(As < u) (for u 0+) is given by 

F,(u) = + o(7.^'(^)^-(^)) (66) 



where -D(.s) =^ WTIWTI- Using Lemma [T] and ( l66b . the bound in ( l64l i can be further written as 



C{s) 



T 

where Sk = (nt - ik + l)("r - «fc + 1) and Ck 



P;<(|5.|-1H(^)"Vo(,-^) (67) 



2Sk 



Appendix B 
Proof of Theorem 2 

Let d{9k, Xi^^, Xj^) = min(p ,j)gs, d1{p,q,dk), where df.{p,q,9k) is defined in ( l40l i. The objective is to find the 
optimal 9k which maximizes d{9k, Xi^, Xj^)- The set §2 contains exactly 8 elements. Also due to sign symmetry 
of this set ( i.e. if {p, q) e S2 then so do (p, — g), (— p, q) and (— p, — g)), there are actually only 4 distances to be 
computed. For a given angle 9k these distances are enumerated as follows : 

di = Xl cos^ (Ok) + Xlsm^ {9k) 

d2 = XlW{9k) + X]^cos''{9k) 

d3 = Xl icos{9k) + sin(0fe))2 + A^^ (cos((?fc) - sin(0fc))2 

di = Xl{cos{9k) - MSk))'' + Xl{cos{ek) + sm{ek)f 

Therefore, d{9k, Xi^, Xj^) can be expressed in terms of these distances as 

d{9k,Xik^^jk) = min(di,d2,rf3,c'4) 

= min(min((ii, ^2)7 min((i3, ^4)) 
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Just as in Appendix [A) it can be shown that for the maximization in ( [39] l. it suffices to only consider the range of 
Ok to be [0, 7r/4]. Also it is given that A^j. > Xj^ due to the order of the singular values in the SVD decomposition 
and the way in which pairing is done for X-Codes. Using these two facts, it can be concluded that 

d2 = min(di, (i2) 
di ~ min((i3, ^4) 

and therefore 

d{9k , Xt^ , Xj^ ) = min(d2 , ^4) 

Let 

o A Xi^ 
Pk = T — 

Then, ^4 is the minimum if the following condition is satisfied. 



where 



4 < HOk) (68) 



sin(26'fe) +sin^(6lfc) 

It can be seen that over the interval (0 j], h{9k) is a continuous and monotonically increasing function. This is 
because the first derivative w.rt 6k exists and is always positive. The maximum value of h{9k) over this interval is 
i. Therefore, we now consider two situations depending upon whether (3k is greater than or less than -y/S- 

If /3fc < V3, then ji > \. Since h{6k) is always less than i, we can conclude that the condition in ( |68] ) is never 
satisfied and therefore ^2 is the minima. Further since A^^. > Ajj., ^2 is a monotonically increasing function of 6k 
and therefore the solution to the maximization problem in ( [39] l is 

If Pk > "v/S, then -4- < i. Since h{6k) is a monotonically increasing function we observe that ^4 is the minima 

Pk 

when 6k > 6^ or else ^2 is the minima. Here 6^ is such that 0^ G [0 |-] and h{dl) — -4-. Further, it is observed 

' k 

that d2 is a monotonically increasing function of 6k whereas ^4 is monotonically decreasing. Also, d-z ~ d4 when 

dk = 61. Therefore, it can be concluded that min((i2,c?4) is maximized when dk ~ 61. Hence for Pk > the 

solution to the maximization problem in (|39] | is 6^. We now solve for 6^. Using the definition of h{dk), we have 

1 1 
1 



2 „;„.o/)*^ , sin2 



/3fc siii(26i 



'kJ 



or equivalently, 

tan2(0*)-2(/32-l)tan(0*)+/3fc = O 
The last equation is quadratic in tan(0^), and the solution which results in 6^ G [0 j] is given by 



tan ^ 



Combining, the optimal angles obtained for Pk < and (3k > \/3, we get the solution to the maximization 
problem as stated in dTTl i. ■ 
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Appendix C 
Proof of Theorem 3 

We first get an expression for df, jjjin(afe, ^fe) as defined in ( |56] |. For any code vector at index v which is even, 
the nearest distance to any other code vector with even index is AXf^al- The nearest distance to any code vector at 
odd index is Af^a^ + 4A|^5^. The same holds true if v is odd. Hence niin(^'^' ^k) is given by 

dUnMk,b,) = min {4X1^, Xlal + AXlbl) (69) 

Therefore, our objective is to solve the following constrained min-max optimization problem. 

{al,bl) = arg max mm Uxlal Xlal + AXlbl) (70) 

In dTOl i. we let T\ = 4Af a|, which is geometrically a straight line w.rt. a| passing through the origin and attaining 



a maximum value of 



at 



48PrAf, 

nr(M2 - 1 



2 12i\ 



This is due to the transmit power constraint 



In (|70|, we let T2 ^ (^\a\ + ^X]h\). Since 



nr(Af2 - 1) 

- n,(Af2-l) 
2 (A.f2 - 1) _ 



+ 12 
we can express T2 as 

From dTTI ). we observe that, if /3| > (A/2 — l)/3, then r2 is a straight line with positive slope, with a value of 
4A2^ Pt jn^ at afc = and attaining a maximum value of 

12PTAf^ 



nr(M2 - 1) 

at 

2 ^ 12Pt 
n,(Af2-l) 

This maximum value is less than the maximum attained by T-^. Since both Ti and T2 have positive slope the 
minimum among T\ and T2 is maximized at a^." = 12PT/{nr{M'^ — 1)), which implies that hi = 0. The value of 
df. jj,in(afc, fefc) at flfc = a^. is the maximum value attained by T2. Therefore, when the channel condition exceeds a 
certain threshold, it is optimal to allocate all power to the stronger channel only. 
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On the other hand, if < , then T2 is a straight Hne with negative slope, whereas Ti has positive slope, 

and therefore the minimum between them is maximized when they are both equal. Therefore, the optimal (a^, 5^.) 
must satisfy 

AXlal' ^ Xlal' + iXlbf (72) 
Using the fact that b^. + ^^j^^ = the optimal (a|;, h\) is given by 

{al^bl) = I , " , ^ I (73) 




9 

Using ( 173] ), the optimal value of d\ min(afc, for < ^^-^ is given by 
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Figure Captions: 

1) Union bound for word error probability, n.,. = nt — 2 and M = 2 (4-QAM) modulation. 

2) Sensitivity of word error probability w.r.t Oi. n,. = nt ^ 2 and A/ = 2,4 (4,16-QAM) modulation. 

3) One quadrant of the set Sm for M = 2,4 (4,16-QAM modulation). The critical angles where performance 
degrades severely are shown to coincide with tan^^(— p/q). 

4) Received signal space for the real component of the fc-th pair. AI = 8 and therefore we have 5 regions 
with vertical dashed lines demarcating the boundary between the regions. The scaled codebook vectors 
are represented by small filled circles along with their corresponding codebook index number Dotted lines 
demarcate the boundary between the ML decision regions. 

5) Effect of the channel condition number on error performance of various precoders for a 2 x 2 system with 
target spectral efficiency of 8bps/Hz. 

6) BER comparison between various precoders for rir = nt = ng = 2,4 and M = 2 (4-QAM) modulation. 
Target spectral efficiency is equal to 2ns bps/Hz. 

7) BER comparison between various precoders for nr = nt ~ 2 and target spectral efficiency of 4, 8 bps/Hz. 

8) BER comparison between various precoders for = = 4 and target spectral efficiency of 8, 16 bps/Hz. 

9) BER comparison between the proposed X-Codes and Y-Codes for n,. ~ nt = 2 with spectral efficiency = 4, 
8 bps/Hz. 

10) BER comparison between the proposed X-Codes and Y-Codes for = = 4 with spectral efficiency = 8, 
16 bps/Hz. 

11) Word error probabiUty comparison between the proposed suboptimal Y-Precoders and exact optimal Y- 
Precoders for = rit = 2,4 with spectral efficiency of Ant bps/Hz. 
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y(dB) 



Fig. 1. Union bound for word error probability, rir = nt = 2 and M = 2 (4-QAM) modulation. 




45 



Angle - 6, (Deg) 



Fig. 2. Sensitivity of word eiTor probability w.rt 6i. n,. = rit = 2 and M = 2,4 (4,16-QAM) modulation. 
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Fig. 4. Received signal space for the real component of the fe-th pair. M = 8 and therefore we have 5 regions with vertical dashed lines 
demarcating the boundary between the regions. The scaled codebook vectors are represented by small filled circles along with their corresponding 
codebook index number. Dotted lines demarcate the boundary between the ML decision regions. 
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Channel gain = A^) 




Condition number (P) 



Fig. 5. Effect of the channel condition number on en'or performance of various precoders for a 2 X 2 system with target spectral efficiency 
equal to 8bps/Hz. 




Fig. 6. BER comparison between various precoders for Ur = nt = Us = 2, A and M = 2 (4-QAM) modulation. Target spectral efficiency is 
equal to 2ns bps/Hz. 
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Fig. 7. BER comparison between various precoders for Ur = nt = 2 and target spectral efficiency of 4, 8 bps/Hz. 
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BER comparison between various precoders for rir = nt = 4 and target spectral efficiency of 8, 16 bps/Hz. 
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Spectral Efficiency = 4 bps/Hz 



- -o- - X-Codes (4 bps/Hz) 

*- - Y-Codes (4 bps/Hz) 

X-Precoder (4 bps/Hz) 

a Y-Precoder (4 bps/Hz) 

e X-Codes (8 bps/Hz) 

» Y-Codes (8 bps/Hz) 

X-Precoder (8 bps/Hz) 

Y-Precoder (8 bps/Hz) 



Spectral Efficiency = 8 bps/Hz 



10 



15 



20 
Y(dB) 



25 



30 



35 



40 



Fig. 9. BER compai'isoii between the proposed X-Codes and Y-Codes for rir 
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Fig. 10. BER comparison between the proposed X-Codes and Y-Codes for = nt = 4 with spectral efficiency of 8, 16 bps/Hz. 
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Fig. 11. Word error probability comparison between the proposed suboptimal Y-Precoders and exact optimal Y-Precoders for n,- = nt = 2, 4 
with spectral efficiency of Ant bps/Hz. 
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